Indexing Textual Information

نویسندگان

  • Ioannis N. Kouris
  • Christos Makris
  • Evangelos Theodoridis
  • Athanasios K. Tsakalidis
چکیده

Information retrieval is the computational discipline that deals with the efficient representation, organization, and access to information objects that represent natural language texts (Baeza-Yates, & Ribeiro-Neto, 1999; Salton & McGill, 1983; Witten, Moûat, & Bell, 1999). A crucial subproblem in the information retrieval area is the design and implementation of efficient data structures and algorithms for indexing and searching information objects that are vaguely described. In this article, we are going to present the latest developments in the indexing area by giving special emphasis to: data structures and algorithmic techniques for string manipulation, space efficient implementations, and compression techniques for efficient storage of information objects. The aforementioned problems appear in a series of applications as digital libraries, molecular sequence databases (DNA sequences, protein databases [Gusûeld, 1997)], implementation of Web search engines, web mining and information filtering.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Relating Graphical Features with Concept Classes for Automatic News Video Indexing

Automatic indexing of video data, especially news videos, is in strong demand considering their contents' importance and value. Various attempts have been made to index news videos automatically in order to cope with this demand, including recent challenges that utilize accompanying textual information. However, most of these methods tend to be textual information driven, which do not thoroughl...

متن کامل

A Retrieval System for Graphical Documents

We present a method for indexing line drawings automatically. The indexing scheme is used for the retrieval of line-drawings in a weighted information retrieval (IR) system. Being content-based, the indexing method depends not only on the graphical structures in the drawings, but on the textual entries as well. No a priori knowledge is used in the indexing scheme, since application-speciic assu...

متن کامل

The SPIRIT Spatial Search Engine: Architecture, Ontologies and Spatial Indexing

The SPIRIT search engine provides a test bed for the development of web search technology that is specialised for access to geographical information. Major components include the user interface, geographical ontology, maintenance and retrieval functions for a test collection of web documents, textual and spatial indexes, relevance ranking and metadata extraction. Here we summarise the functiona...

متن کامل

Towards Cross-Media Feature Extraction

In this paper we describe past and present work dealing with the use of textual resources, out of which semantic information can be extracted in order to provide for semantic annotation and indexing of associated image or video material. Since the emergence of semantic web technologies and resources, entities, relations and events extracted from textual resources by means of Information Extract...

متن کامل

Normalizing Spatial Information to Improve Geographical Information Indexing and Retrieval in Digital Libraries

Our contribution is dedicated to geographic information contained in unstructured textual documents. The main focus of this article is to propose a general indexing strategy that is dedicated to spatial information, but which could be applied to temporal and thematic information as well. More specifically, we have developed a process flow that indexes the spatial information contained in textua...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009